Towards the Use of Deep Reinforcement Learning with Global Policy for Query-based Extractive Summarisation

نویسنده

Diego Mollá Aliod

چکیده

Supervised approaches for text summarisation suffer from the problem of mismatch between the target labels/scores of individual sentences and the evaluation score of the final summary. Reinforcement learning can solve this problem by providing a learning mechanism that uses the score of the final summary as a guide to determine the decisions made at the time of selection of each sentence. In this paper we present a proof-of-concept approach that applies a policy-gradient algorithm to learn a stochastic policy using an undiscounted reward. The method has been applied to a policy consisting of a simple neural network and simple features. The resulting deep reinforcement learning system is able to learn a global policy and obtain encouraging results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Macquarie University’s contribution to the BioASQ challenge (Task 5b Phase B) focused on the use of query-based extractive summarisation techniques for the generation of the ideal answers. Four runs were submitted, with approaches ranging from a trivial system that selected the first n snippets, to the use of deep learning approaches under a regression framework. Our experiments and the ROUGE r...

متن کامل

Macquarie University at BioASQ 5b - Query-based Summarisation Techniques for Selecting the Ideal Answers

متن کامل

Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

متن کامل

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...

متن کامل

RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features

Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Towards the Use of Deep Reinforcement Learning with Global Policy for Query-based Extractive Summarisation

نویسنده

چکیده

منابع مشابه

Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Macquarie University at BioASQ 5b - Query-based Summarisation Techniques for Selecting the Ideal Answers

Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers

Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm

RRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features

عنوان ژورنال:

اشتراک گذاری